Using Ensembles to Classify Compounds for Drug Discovery
نویسندگان
چکیده
This paper introduces Signal, a novel method for classifying activity against a small molecule drug target. Signal creates an ensemble, or collection, of meaningful descriptors chosen from a much larger property space. The method works with a variety of descriptor types, including fingerprints that represent four-point pharmacophores or shape descriptors. It also exploits information from both active and inactive compounds and generates predictive models suitable for high throughput screening data analysis. Given the fingerprints and activity data for a set of compounds, Signal is a two step process. The first step is to Evaluate the Descriptors: for each descriptor in the fingerprint, quantify and rank the correlation between the activity of the compounds and the presence of that descriptor. The second step is to Create an Ensemble Model: use the high ranking descriptors to create a model of activity against the biological target. For the first step, two possible ranking strategies were investigated: mutual information and chi-square. For the second step, two types of ensemble models were investigated: high ranking and a novel method called high ranking set cover. Of the four possible pairings, the combination of chi-square and high ranking set cover performed the best on a Thrombin data set.
منابع مشابه
Assessment of "drug-likeness" of a small library of natural products using chemoinformatics
Even though natural products has an excellent record as a source for new drugs, the advent of ultrahigh-throughput screening and large-scale combinatorial synthetic methods, has caused a decline in the use of natural products research in the pharmaceutical industry. This is due to the efficiency in generating and screening a high number of synthetic combinatorial compounds; whereas traditional ...
متن کاملAssessment of "drug-likeness" of a small library of natural products using chemoinformatics
Even though natural products has an excellent record as a source for new drugs, the advent of ultrahigh-throughput screening and large-scale combinatorial synthetic methods, has caused a decline in the use of natural products research in the pharmaceutical industry. This is due to the efficiency in generating and screening a high number of synthetic combinatorial compounds; whereas traditional ...
متن کاملDiscovery of Novel Glucagon Receptor Antagonists Using Combined Pharmacophore Modeling and Docking
Glucagon and the glucagon receptor are most important molecules control over blood glucose concentrations. These two molecules are very important to studies of type 2 diabetic patients. In literature, several classes of small molecule antagonists of the human glucagon receptor have been reported. Glucagon receptor antagonist could decrease hepatic glucose output and improve glucose control in d...
متن کاملDiscovery of Novel Glucagon Receptor Antagonists Using Combined Pharmacophore Modeling and Docking
Glucagon and the glucagon receptor are most important molecules control over blood glucose concentrations. These two molecules are very important to studies of type 2 diabetic patients. In literature, several classes of small molecule antagonists of the human glucagon receptor have been reported. Glucagon receptor antagonist could decrease hepatic glucose output and improve glucose control in d...
متن کاملDrug Discovery Acceleration Using Digital Microfluidic Biochip Architecture and Computer-aided-design Flow
A Digital Microfluidic Biochip (DMFB) offers a promising platform for medical diagnostics, DNA sequencing, Polymerase Chain Reaction (PCR), and drug discovery and development. Conventional Drug discovery procedures require timely and costly manned experiments with a high degree of human errors with no guarantee of success. On the other hand, DMFB can be a great solution for miniaturization, int...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of chemical information and computer sciences
دوره 43 6 شماره
صفحات -
تاریخ انتشار 2003